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1 AAAAAGAAAG GAAGAAAATG GAAATACAAC AAACACACCG CAAAATCAAT 
51 CGCCCTCTGG TTTCTCTCGC TTTAGTAGGA GCATTAGTCA GCATCACACC 
101 GCAACAAAGT CATGCCGCCT TTTTCACAAC CGTGATCATT CCAGCCATTG 
151 TTGGGGGTAT CGCTACAGGC ACCGCTGTAG GAACGGTCTC AGGGCTTCTT 
201 AGCTGGGGGC TCAAACAAGC CGAAGAAGCC AATAAAACCC CAGATAAACC 
251 CGATAAAGTT TGGCGCATTC AAGCAGGAAA AGGCTTTAAT GAATTCCCTA 
301 ACAAGGAATA CGACTTATAC AGATCCCTTT TATCCAGTAA GATTGATGGA 
351 GGTTGGGATT GGGGGAATGC CGCTAGGCAT TATTGGGTCA AAGGCGGGCA 
401 ACAGAATAAG CTTGAAGTGG ATATGAAAGA CGCTGTAGGG ACTTATACCT 
451 TATCAGGGCT TAGAAACTTT ACTGGTGGGG ATTTAGATGT CAATATGCAA 
501 AAAGCCACTT TACGCTTGGG CCAATTCAAT GGCAATTCTT TTACAAGCTA 
551 TAAGGATAGT GCTGATCGCA CCACGAGAGT GATTTCAACG CTAAAAATAT 
601 CTCAATTGAT AATTTTGCAG AAATCAACAA CTCGTGTGGG TTCTGGAGCC 
651 GGGAGGAAAG CCAGCTCTAC GGTTTTGACT TTGCAAGCTT CAGAAGGGAT 
701 CACTAGCGAT AAAAACGCTG AAATTTCTCT TTATGATGGT GCCACGCTCA 
751 ATTTGGCTTC AAGCAGCGTT AAATTAATGG GTAAT6TGTG GATGGGCCGT 
801 TTGCAATACG TGGGAGCGTA TTTGGCCCCT TCATACAGCA CGATAAACAC 
851 TTCAAAAGTA ACAGGGGAAG TGAATTTTAA CCACCTCACT GTTGGCGATA 
901 AAAACGCCGC TCAAGCGGGC ATTATCGCTA ATAAAAAGAC TAATATTGGC 
951 ACACTGGATT TGTGGCAAAG CGCCGGGTTA AACATTATCG CTCCTCCAGA 
1001 AGGTGGCTAT AAGGATAAAC CCAATAATAC CCCTTCTCAA AGTGGTGCTA 
1051 AAAACGACAA AAATGAAAGC GCTAAAAACG ACAAACAAGA GAGCAGTCAA 
1101 AATAATAGTA ACACTCAGGT CATTAACCCA CCCAATAGTG CGCAAAAAAC 
1151 AGAAGTTCAA CCCACGCAAG TCATTGATGG GCCTTTTGCG GGCGGCAAAG 
1201 ACACGGTTGT CAATATCAAC CGCATCAACA CTAACGCTGA TGGCACGATT 
1251 AGAGTGGGAG GGTTTAAAGC TTCTCTTACC ACCAATGCGG CTCATTTGCA 
1301 TATCGGCAAA GGCGGTGTCA ATCTGTCCAA TCAAGCGAGC GGGCGCTCTC 
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1351 TTATAGTG6A AAATCTAACT GGGAATATCA CCGTTGATGG GCCTTTAAGA 
1401 GTGAATAATC AAGTGGGTGG CTATGCTTTG GCAGGATCAA GCGCGAATTT 
1451 TGAGTTTAAG GCTGGTACGG ATACCAAAAA CGGCACAGCC ACTTTTAATA 
1501 ACGATATTAG TCTGGGAAGA TTTGTGAATT TAAAGGTGGA TGCTCATACA 
1551 GCTAATTTTA AAGGTATTGA TACGGGTAAT GGTGGTTTCA ACACCTTAGA 
1601 TTTTAGTGGC GTTACAGACA AAGTCAATAT CAACAAGCTC ATTACGGCTT 
1651 CCACTAATGT GGCCGTTAAA AACTTCAACA TTAATGAATT GATTGTTAAA 
1701 ACCAATGGGA TAAGTGTGGG GGAATATACT CATTTTAGCG AAGATATAGG 
1751 CAGTCAATCG CGCATCAATA CCGTGCGTTT GGAAACTGGC ACTAGGTCAC 
1801 TTTTCTCTGG GGGTGTTAAA TTTAAAGGTG GCGAAAAATT GGTTATAGAT 
1851 GAGTTTTACT ATAGCCCTTG GAATTATTTT GACGCTAGAA ATATTAAAAA 
1901 TGTTGAAATC ACCAATAAAC TTGCTTTTGG ACCTCAAGGA AGTCCTTGGG ' 
1951 GCACATCAAA ACTTATGTTC AATAATCTAA CCCTAGGTCA AAATGCGGTC 
2001 ATGGATTATA GCCAATTTTT AAATTTAACC ATTCAAGGGG ATTTCATCAA 
2051 CAATCAAGGC ACTATCAACT ATCTGGTCCG AGGTGGGAAA GTGGCAACCT 
2101 TAAGCGTAGG CAATGCAGCA GCTATGATGT TTAATAATGA TATAGACAGC 
2151 GCGACCGGAT TTTACAAACC GCTCATCAAG ATTAACAGCG CTCAAGATCT 
2201 CATTAAAAAT ACAGAACATG TTTTATTGAA AGCGAAAATC ATTGGTTATG 
2251 GTAATGTTTC TACAGGTACC AATGGCATTA GTAATGTTAA TCTAGAAGAG 
2301 CAATTCAAAG AGCGCCTAGC CCTTTATAAC AACAATAACC GCATGGATAC 
2351 TTGTGT6GT43 CGAAATACTG ATGACATTAA AGCATGCGGT ATGGCTATCG 
2401 GCGATCAAAG CATGGTGAAC AACCCTGACA ATTACAAGTA TCTTATCGGT 
2451 AAGGCATGGA AAAATATAGG GATCAGCAAA ACAGCTAATG GCTCTAAAAT 
2501 TTCGGTGTAT TATTTAGGCA ATTCTACGCC TACTGAGAAT GGTGGCAATA 
2551 CCACAAATTT ACCCACAAAC AGCACTAGCA ATGCACGTTC TGCCAACAAC 
2601 GCCCTTGCAC AAAACGCTCC TTTCGCTCAA CCTAGTGCTA CTCCTAATTT 
2651 AGTCGCTATC AATCAGCATG ATTTTGGCAC TATTGAAAGC GTGTTTGAAT 

f 
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2701 TGGCTAACCG CTCTAAAGAT ATTGACACGC TTTATGCTAA CTCAGGCGCT 
2751 CAAGGCAGGG ATCTCTTACA AACCTTATTG ATTGATAGCC ATGATGCGGG 
2801 TTATGCCAGA AAAATGATTG ATGCTACAAG CGCTAATGAA ATCACCAAGC 
2851 AATTGAATAC GGCCACTACC ACTTTAAACA ACATAGCCAG TTTAGAGCAT 
2901 AAAACCAGCG GCTTACAAAC TTTGAGCTTG AGTAATGCGA TGATTTTAAA 
2951 TTCTCGTTTA GTCAATCTCT CCAGGAGACA CACCAACCAT ATTGACTCGT 
3001 TCGCCAAACG CTTACAAGCT TTAAAAGACC AAAAATTCGC TTCTTTAGAA 
3051 AGCGCGGCAG AAGTGTTGTA TCAATTTGCC CCTAAATATG AAAAACCTAC 
3101 CAATGTTTGG GCTAACGCTA TTGGGGGAAC GAGCTTGAAT AATGGCTCTA 
3151 ACGCTTCATT GTATGGCACA AGCGCGGGCG TAGACGCTTA CCTTAACGGG 
3201 CAAGTGGAAG CCATTGTGGG CGGTTTTGGA AGCTATGGTT ATAGCTCTTT 
3251 TAATAATCGT GCGAACTCCC TTAACTCTGG GGCCAATAAC ACTAATTTTG 
3301 GCGTGTATAG CCGTATTTTA ACCAACCAGC ATGAATTTGA CTTTGAAGCT 
3351 CAAGGGGCAC TAGGGAGCGA TCAATCAAGC TTGAATTTCA AAAGCGCTCT 
3401 ATTACAAGAT TTGAATCAAA GCTATCATTA CTTAGGCTAT AGCGCTGCAA 
3451 CAAGAGCGAG CTATGGTTAT GACTTCGCGT TTTTTAGGAA CGCTTTAGTG 
3501 TTAAAACCAA GCGTGGGTGT GAGCTATAAC CATTTAGGTT CAACCAACTT 
3551 TAAAAGCAAC AGCACCAATC AAGTGGCTTT GAAAAATGGC TCTAGCAGTC 
3601 AGCATTTATT CAACGCTAGC GCTAATGTGG AAGCGCGCTA TTATTATGGG 
3651 GACACTTCAT ACTTCTACAT GAATGCTGGA GTTTTACAAG AGTTCGCTCA 
3701 TGTTGGCTCT AATAACGCCG CGTCTTTAAA CACCTTTAAA GTGAATGCCG 
3751 CTCGCAACCC TTTAAATACC CATGCCAGAG TGATGATGGG TGGGGAATTA 
3801 AAATTAGCTA AAGAAGTGTT TTTGAATTTG GGCGTTGTTT ATTTGCACAA 
3851 TTTGATTTCC AATATAGGCC ATTTCGCTTC CAATTTAGGA ATGAGGTATA 
3901 GTTTCTAAAT ACCGCTCTTA AACCCATGCT CAAAGCATGG GTTTGAAATC 
3951 TTACAAAACA 
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1 MEIQQTHRKI NRPLVSLALV GALVSITPQQ SHAAFFTTVI IPAIVGGIAT 
51 GTAVGTVSGL LSWGLKQAEE ANKTPDKPDK VWRIQAGKGF NEFPNKEYDL 
101 YRSLLSSKID GGWDWGNAAR HYWVKGGQQN KLEVDMKDAV GTYTLSGLRN 
151 FTGGDLDVNM QKATLRLGQF NGNSFTSYKD SADRTTRVIS TLKISQLI IL 
201 QKSTTRVGSG AGRKASSTVL TLQASEGITS DKNAEISIYD GATLNLASSS 
251 VKLMGNVWMG RLQYVGAYLA PSYSTINTSK VTGEVNFNHL TVGDKNAAQA 
301 GI IANKKTNI GTLDLWQSAG LNIIAPPEGG YKDKPNNTPS QSGAKNDKNE 
351 SAKNDKQESS QNNSNTQVIN PPNSAQKTEV QPTQVIDGPF AGGKDTVVNI 
401 NRINTNADGT IRVGGFKASL TTNAAHLHIG KGGVNLSNQA SGRSLIVENL 
451 TGNITVDGPL RVNNQVGGYA LAGSSANFEF KAGTDTKNGT ATFNNDISLG 
501 RFVNLKVDAH TANFKGIDTG NGGFNTLDFS GVTDKVNINK LITASTNVAV 
551 KNFNINELIV KTNGISVGEY THFSEDIGSQ SRINTVRLET GTRSLFSGGV 
601 KFKGGEKLVI DEFYYSPWNY FDARNIKNVE ITNKLAFGPQ GSPWGTSKLM 
651 FNNLTLGQNA VMDYSQFLNL TIQGDFINNQ GTINYLVRGG KVATLSVGNA 
701 AAMMFNNDID SATGFYKPLI KINSAQDLIK NTEHVLLKAK I IGYGNVSTG 
751 TNGISNVNLE EQFKERLALY NNNNRMDTCV VRNTDDIKAC GMAIGDQSMV 
801 NNPDNYKYLI GKAWKNIGIS KTANGSKISV YYLGNSTPTE NGGNTTNLPT 
851 NTTSNARSAN NALAQNAPFA QPSATPNLVA INQHDFGTIE SVFELANRSK 
901 DIDTLYANSG AQGRDLLQTL LIDSHDAGYA RKMIDATSAN EITKQLNTAT 
951 TTLNNIASLE HKTSGLQTLS LSNAMILNSR LVNLSRRHTN HIDSFAKRLQ 
1001 ALKDQKFASL ESAAEVLYQF APKYEKPTNV WANAIGGTSL NNGSNASLYG 
1051 TSAGVDAYLN GQVEAIVGGF GSYGYSSFNN RANSLNSGAN NTNFGVYSRI 
1101 LTNQHEFDFE AQGALGSDQS SLNFKSALLQ DLNQSYHYLA YSAATRASYG 
1151 YDFAFFRNAL VLKPSVGVSY NHLGSTNFKS NSTNQVALKN GSSSQHLFNA 
1201 SANVEARYYY GDTSYFYMNA GVLQEFAHVG SNNAASLNTF KVNAARNPLN 
1251 THARVMMGGE LKLAKEVFLN LGWYLHNLI SNIGHFASNL GMRYSF 
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CTCCATTTTAA6CAACTCCATA6ACCACTAAA6AAACTTTTTTTGA66CTATCTTTGAAA 
6CTTAATTATACATGCTATAGTAAGCAT6ACACACAAACCAAACTATTTTTA6AAC6CTT 
TCAAAAAGATTCATTTCTTATTTCTTGTTCTTATTAAAGTTCTTTCATTTTAGCAAATTT 
CTTTTTTCAATATTAATAATGATTAATGAAAAAAAAAAAAAATGCTTGATATTGTTGTAT 
TTGACACTAACAAGATACCGATAGGTATGAAACTAGGTATAGTA AGGAG AAACAATGArT 

M T 

AATAATCTTCAAGTAGCTTTTCTTAAAGTTGATAACGCTGTCGCTTCATACGATCCTGAT 
23NNLQVAFLKVDNAVASYDPD 

CAATTAAGGGAAGAATACTCCAATAAAGCGATCAAAAATCCTACCAAAAAGAATCAGTAT 
63QLREEYSNKA I KNPTKKNQY 

GAATCTTCCACAAAGAGCTTTCAGAAATTTGGGGATCAGCGTTACCGAATTTTCACAAGT 
103 ESSTKSFQKFGDQRYRIFTS 

GAAAATATCATACAACCCCCTATCCTTGATGATAAAGAGAAAGCGGAGTTTTTGAAATCT 
143 ENI IQPP I LDDKEKAEFLKS 

ATGGGCGTGTTTGATGAGTCCTTGAAAGAAAGGCAAGAAGCAGAAAAAAATGGAGAGCCT 
183 MGVFDESLKERQEAEKNGEP 

GATGTCAAAGAAGCAATCAATCAAGAACCAGTTCCCCATGTCCAACCAGATATAGCCACT 
223 DVKEA I NQEPV PHVQPDIAT 

AATTTTTCTAAATTCACTCTTGGCGATATGGAAATGTTAGATGTTGAGGGAGTCGCTGAC 
263 NFSKFTLGDME MLDVEGVAD 

TTAATGGGGAGTCATAATGGCATAGAACCTGAAAAAGTTTCATTGTTGTATGGGGGCAAT 
303 1 MGSHNG I EPEKVSLLYGGN 

AACAATGTGGCTACAATAATTAATGTGCATATGAAAAACGGCAGTGGCTTAGTCATAGCA 
313 U V A T I INVHMKNGSGLVIA 

GGCTCACAACGAGCATTAAGTCAAGAAGAGATCCAAAACAAAATAGATTTCATGGAATTT 
383 GSQRALSQEEIQNKI DFMEF 

ACTGAGATTAAAGATTTCCAAAAAGACTCTAAGGCTTATTTAGACGCCCTAGGGAATGAT 
423 TEIKDFQKDSKAYLDALGND 

AATGGGGATTTGAGCTACACTCTCAAAGATTATGGGAAAAAAGCAGATAAAGCTTTAGAT 
463 NGDL S YT LKDYGKKA D K A LD 

TATTCTAATTTCAAATACACCAACGCCTCCAAGAATCCCAATAAGGGTGTAGGCGTTACG 
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ATCTGTCCTATT6ATTTGTTTTCCATTTTGTTTCCCATGTGGATCTT6T66ATCACAAAC 120 
CATGTGCTCACCTTGACTAACCATTTCTCCAACCATACTTTAGCGTTGCATTTGATTTCT 240 
TTGTTAATTGTGGGTAAAAATGTGAATCGTCCTAGCCTTTAGACGCCTGCAACGATCGGG 360 
AATGAGAATGTTCAAAGACATGAATTGACTACTCAAGCGTGTAGC6ATTTTTAGCAGTCT H80 
AACGAAACCATTGACCAACAACCACAAACCGAAGCGGCTTTTAACCCGCAGCAATTTATC 600 
NETIDQQPQTEAAFNPQQFI 
CAAAAACCAATCGTTGATAAGAACGATAGGGATAACAGGCAAGCTTTTGAAGGAATCTCG 720 
QKPIVDKNDRDNRQAFEGIS 
TTTTCAGACTTTATCAATAAGAGCAATGATTTAATCAACAAAGACAATCTCATTGATGTA 840 
FSDFINKSNDLINKDNLIDV 
TGGGTGTCCCATCAAAACGATCCGTCTAAAATCAACACCCGATCGATCCGAAATTTTATG 960 
WVSHQNDPSKINTRSI RNFM 
GCCAAACAATCTTTTGCAGGAATCATTATAGGGAATCAAATCCGAACGGATCAAAAGTTC 1080 
AKQSFAGI I IGNQIRTDQKF 
ACTGGTGGGGATTGGTTGGATATTTTTCTCTCATTTATATTTGACAAAAAACAATCTTCT 1200 
TGGDWLDIFLSFIFDKKQSS 
ACCACCACCGACATACAAGGCTTACCGCCTGAAGCTAGAGATTTACTTGATGAAAGGGGT 1320 
TTTDIQGLPPEARDLLDERG 
ATTGATCCCAATTACAAGTTCAATCAATTATTGATTCACAATAACGCTCTGTCTTCTGTG 1440 
I DPNYKFNQLL I HNNALS SV 
GGTGGTCCTGGAGCTAG6CATGATTGGAACGCCACCGTTGGTTATAAAGACCAACAAGGC 1560 
GGPGARHDWNATVGYKDQQG 
GGTGGTGAGAAAGGGATTAACAACCCTAGTTTTTATCTCTACAAAGAAGACCAACTCACA 1680 
GGEKG I NNPSF YLYKEDQLT 
CTTGCACAAAATAATGCTAAATTAGACAACTTGAGCGAGAAAGAGAAGGAAAAATTCCGA 1800 
LAQNNAKLDNLSEKEKEKFR 
CGTATTGCTTTTGTTTCTAAAAAAGACACAAAACATTCAGCTTTAATTACTGAGTTTGGT 1920 
R I AFVSKKDTKHSAL I TEFG 
AGGGAGAAAAATGTTACTCTTCAAGGTAGCCTAAAACATGATGGCGTGATGTTTGTTGAT 2040 
REKNVTLQGSLKHDGVMFVD 
AATGGCGTTTCCCATTTAGAAGTAGGCTTTAACAAGGTAGCTATCTTTAATTTGCCTGAT 2160 
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503 YSNFKYTNASKNPNKGVGVT 

TTAAATAATCTC6CTATCACTAGTTTCGTAA66C66AATTTAGA6GATAAACTAACCACT 
543 L N N L A ITSFVRRNLCDKLTT 

GAATTGGTTGGAAAAACTTTAAACTTGAATAAAGCTGTAGC1GACGCTAAAAACACAGGC 
583 ELVGKTLN FNKAVADAKNTG 

CATTTAGAGAAAGAAGTAGAGAAAAAATTGGAGAGCAAAAGCGGCAACAAAAATAAAATG 
623 HLEKEVEKKLESKS6NKNKM 

GCTAATAGAGACGCAAGAGCAATCGCTTACGCTCAGAATCTTAAAGGCATCAAAAGGGAA 
663 ANRDARA I AYAQNLKGIKRE 

GAATTCAAAAATGGCAAAAATAAGGATTTCAGCAA GGCAGAAGAAACACTAAAAGCCCTT 
703 IEFKNGKNKDF S j<| AEETLKAL 

AATGCAGCTTTGAA TGAATTCAAAAATGGCAAAAATAAGGATTTCAGCAA GGTAACGCAA 
743 N A A L N |E FKNGKNKDFS"k) V T Q 

AAAGTTGATAATCTCAATCAAGCGGTATCAGTGGCTAAAGCAACGGGTGATTTCAGTAGG 
783 KVDNLNOAVSVAKATGDF SR 

CAAAAAAATGAAAGTCTCAATGCTAGAAAAAAATCTGAAATATATCAATCCGTTAAGAAT 
823 QKNESLNARKKSEIYQSVKN 

AAAAACTTTTCGGACATCAAGAAAGAGTTGAATGCAAAACTTGGAAATTT CAATAACA AT 
863 KNFSD I KKELNAKLGNF |N N N 

CAAGCAGCTAGCCTTGA AGAACCCATTTACGCT CAAGTTGCTAAAAAGGTAAATGCAAAA 
903 Q A A S L E IE P 1 Y "XI QVAKKVNAK 

CCTTTGAAAAGGCATGATAAAGTTGATGATCTCAGTAAGGT AGGGCTTTCAAGGAATCAA 
943 PLKRHDKVDDLSKVI G L S R N Q 

TTTGGCAATCTAGAGCAAACGATAGACAAGCTCAAAGATTCTACAAAACACAATCCCATG 
983 FGNLEQTI DKLKDSTKHNPM 

TACGCTACTAACAGCCACATACGCATTAATAGCAATATCAAAAATGGAGCAATCAATGAA 

FIG. 4C 
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NGVSHLEVGFNKVAIFNLPD 
AAA6GATT6TCCCCACAAGAAGCTAATAA6CTTATCAAAGATTTTTTGAGCA6CAACAAA 2280 
KGLSPQEANKLIKDFLSSNK 
AATTATGATGAAGTGAAAAAAGCTCAGAAAGATCTTGAAAAATCTCTAAGGAAACGAGAG 2400 
NYDEVK KAQKDLEKSLRKRE 
GAAGCAAAAGCTCAAGCTAACAGCCAAAAAGATGAGATTTTTGCGTTGATCAATAAAGAG 2520 
EAKAQANSGKDEI FAL I NKE 
TTGTCTGATAAACTTGAAAATGTCAACAAGAATTTGAAAGACTTTGATAAATCTTTTGAT 2640 
LSDKLENVNKNIKDFDKSFD 
AAAGGTTCGGTGAAAGATTTAGGTATCAATCCAGAATGGATTTCAAAAGTTGAAAACCTT 2760 
KGSVKDL G I NPEW I SKVENL 
GCAAAAAGCGACCTTGAAAATTCCGTTAAAGATGTGATCATCAATCAAAAGGTAACGGAT 2880 
AKSDLENSVKDVI INQKVTD 
GTAGAGCAAGCGTTAGCCGATCTCAAAAATTTCTCAAAGGAGCAATTGGCCCAACAAGCT 3000 
VEQALADLKNFSKEQL A y 0 Q A 
GGTGTGAATGGAACCCTAGTCGGTAATGGGTTATCTCAAGCAGAAGCCACAACTCTTTCT 3120 
GVNGTLVGNGLSQAEATTLS 
AACAATAA TGGACTCAAAAA CGAACCCATTTATGC TAAAGTTAATAAAAAGAAAGCAGGG 3240 
N N Nl G L K N IE P I Y A] KVNKKKAG 



ATTGACCGACTCAATCAAATAGCAAGTGGTTTGGGTGTTGTAGGGCAAGCAGCGGGCTTC 3360 

I DRLNQI ASGLGVVGQAAG fF 

GA AnGGCTCAGAAAATTG ACAATCTCAATCAAGCGGTATCAGA AGCTA AAGCAGGTTTT 3 480 

ELAQKIDNLNQAVSEAKAGF 

AATCTATGGGTTGAAAGTGCAAAAAAAGTACCTGCTAGTTTGTCAGCGAAACTAGACAAT 3600 

NLWVESAKKVPASISAKLDN 

AAAGCGACCGGCATGCTAACGCAAAAAAACCCTGAGTGGCTCAAGCTCGTGAATGATAAG 3720 
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1023 YATNSH I R I NSN I KNGA I N 

ATAGTT6CGCATAAT6TAGGAAGCGTTCCTTTGTCA6A6TATGATAAAATTGGCTTC 
1063 IVAHNV GSVPLSEYDKIGF 

GTAAAAGACACTAATTCTGGCTTTACGCAATTTTTAACCAATGCATTTTCTACAGCA 
1103 VKDTNSGFTQFLTNAFSTA 

GGTTTCCAAAAATCTTAAAGGATTAAGGAATACCAAAAACGCAAAAACC ACCCCTTG 
1143 G F Q K S 

TGAATGCTACCAATTCATGGTATCATATCCCCATACATTCGTATCTAGCGTAGGAAG 

AACTCTGTAAAATCCCTATTATAGGGACACAGAGTGAGAACCAAACTCTCCCTACGG 

GACAGACACTAACGAAAGGCTTTGTTCTTTAAAGTCTGCATGGATATTTCCTACCCC 

CGAAAATTAATTAAGGGTTATAAAGAGAGCATAAACTAGAAAAAACAAGTAGCTATA 

GAAAAATCAGAAAAACCATAGGAATTATCACACCTTATAATGCCCAAAAAAGACGCT 

ATGCCTTTCAAGGTGAAGAGGCAGATATTATTATTTATTCCACCGTGAAAACTTGTG 

ATCTCATTTTTGTGGGTAAAAAGTCTTTCTTTGAGAATTTATGAAGCGATGAGAAGA 

CATTCTTCGCTTCAAAACGCTTTCATAAATCTCTCTAAAGCGCTTTATAATCAACAC 

TTATTAGCGTTACAATTTGAGCCATTCTTTAGCTTGTTTTTCTAGCCAGATCACATC 

CTGCAAATATCCTACAATAGCATCGCCCGAATGGATGAGTAGGGGGGGTGTTGAAAG 

TAAAATAATCACTTCGG6AAAATCTTTAAGGGAGTGAAATAATAACGCATGCAAGTT 

TGCGAAACATTCAAATAGCCTTGTTGTTTCAGGGCATTGTCATAAGCGTTGGATTGG 

GCTAAAATGCTTGGCTCAATCACGCCCACAATAGGGATTTTGGAATGCTTTTGCATC 

TTGA AAAAA TCCAAAGCCTCTAAGCCAAATTGCTTGATCGTAGTGGGGTCTTTAGTG 

AGGCTTTTTAAAACGCTAAACCCTCCCACACCGCTATCAAAAACGCCTATTTTCATG 

TCTTCATTGTCCTTAGTTTGTTGCATTTTAGAATAGACAAAGCTT 5925 

FIG. 4E 
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EKATGMLTQKNPEWLKLVNDK 

AACCAGAAGAATATGAAA6ATTATTCT6ATTC6TTCAAGTTTTCCACCAA6TT6AACAATGCT 3840 
NQKNMKDYSDSFKFSTKLNNA 

TCTTATTACTGCTTGGCGAGAGAAAATGCGGAGCATGGAATCAAGAACGTTAATACAAAAGGT 3960 
SY YCLARENAEHGI KNVNTKG 

£IAAAA££gAGGGGTTTTTTAATACTCCTTAGCAGAAATCCCAATCGTCTTTAGTATTTGGGA 4080 

TGTGCAAAGTTACGCCTTTGGAGATATGATGTGTGAGACCTGTAGGGAATGCGTTGGAGCTCA 4200 

GCAACATCAGCCTAGGAAGCCCAATCGTCTTTAGCGGTTGGGCACTTCACCTTAAAATATCCC 4320 

AAAAAGACTTAACCCTTTGCTTAAAATTAAGTTTGATTGTGCTAGTGGGTTCGTGCTATAGTG 4440 

ACAAAGATCAAGTTCAAAAAATCATAGAGCTTTTAGAGCAAATTGATCGCGCTCTTAACCAAA 4560 

TGCGATCAGAAGTGGAAAAATACGGCTTCAAGAATTTTGATGAGCTCAAAATAGACACTGTGG 4680 

GTAATCTTTCTTTCTTGCTAGATTCTAAACGCTTGAATGTGGCTATTTCTAGGGCAAAAGAAA 4800 

ATATCTTTAGCGCTATTTTGCAAGTCTGTAGATAGGTAATCTTTTCCAAAGATAATCATTAGA 4920 

AATACCCTTATAGTGTGAGCTATAGCCCCTTTTTGGGAATTGAGTTATTTTGACTTTAAATTT 5040 

GCCGCTCGCATGAAATTCCACTTTAGGGAATGCGTGTGCATTTTTTTTAAGGGCGTATTTTTG 5160 

GGCAAAATGCTCCATAAAATAGCCCTCAATTTTTTGAGCGATTAAGGGAAAATGCGTGCAACC 5280 

TCTAACAATTCGCCCTCTAAAATACTTTCTTCAATCAAAGGCACAAAAAGAGAAGTGGCTAAA 5400 

ATCGTCGCTTTTGTCCCTAGCACTAAAATAGGGGCGTTTTTATCTTTTACTTGTCGCTTGATC 5520 

TCTTCTAAAGCTAGAGCGCTCGCTGTGTTGCATGCCACAATCAATAATTCAATCTGGTGCGGT 5640 

CCATAAGGCACTCTAGCCGTATCGCCATAATAGATGATTTCATCAAATAATTGCGCTTTTAAA 5760 

ACACTTTTTTAATTTAATGGGATTAATTAGGGATTTTATTTTTCATTCATTAAGTTTAAAAAT 5880 
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10 30 50 

AAGCTT6CTGTCAT6ATCACAAAAAACACTAAAAAACATTATTATTAAGGATACAAAATG 

M 

70 90 110 

GCAAAAGAAATCAAATTTTCA6ATAGTGCGAGAAACCTTTTATTTGAAGGCGTGAGGCAA 
AKEIKFSDSARNLLFEGVRQ 

130 150 170 

CTCCATGACGCTGTCAAAGTAACCATGGGGCCAAGAGGCAGGAATGTATTGATCCAAAAA 
LHDAVKVTMGPRGRNVLIQK 

190 210 230 

AGCTATGGCGCTCCAAGCATCACCAAAGACGGCGTGAGCGTGGCTAAAGAGATTGAATTA 
S YGAPS I TKDGVSVAKE I EL 

250 270 290 

AGTTGCCCAGTAGCTAACATGGGCGCTCAACTCGTTAAAGAAGTAGCGAGCAAAACCGCT 
SC.PVANM6AQLVKEVASKTA 

310 330 350 

GATGCTGCCGGCGATGGCACGACCACAGCGACCGTGCTAGCTTATAGCATTTTTAAAGAA 
DAAGDGTTTATVLAYSIFKE 
370 390 410 

GGTTTGAGGAATATCACGGCTGGGGCTAACCCTATTGAAGTGAAACGAGGCATGGATAAA 
GLRNITAGANPIEVKRGMDK 

130 450 170 

GCTGCTGAAGCGATCATTAATGAGCTTAAAAAAGCGAGCAAAAAAGTAGGCGGTAAAGAA 
AAEA I I NELKKASKKVGGKE 

190 510 530 

GAAATCACCCAAGTGGCGACCATTTCTGCAAACTCCGATCACAATATCGGGAAACTCATC 
E ITQVAT I SANSDHN I GKLI 

550 570 590 

GCTGACGCTATGGAAAAAGTGGGTAAAGACGGCGTGATCACCGTTGAGGAAGCTAAGGGC 
ADAMEKVGKDGVI TVEEAKG 

610 630 650 

ATTGAAGATGAATTGGATGTCGTAGAAGGCATGCAATTTGATAGAGGCTACCTCTCCCCT 
IEDELDVVEGMQFDRGYLSP 
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670 690 710 

TATTTT6TAACGAACGCT6A6AAAATGACC6CTCAATTGGATAATGCTTACATCCTTTTA 
YFVTNAEKMTAQLDNAYILL 

730 750 770 

ACGGATAAAAAAATCTCTAGCATGAAAGACATTCTCCCGCTACTAGAAAAAACCATGAAA 
TDKKISSMKDILPLLEKTMK 

790 810 Hindi 1 1 

GAGGGCAAACCGCTTTTAATCATCGCTGAAGACATTGAGGGCG AAGCTT TAACGACTCTA 
EGKPLLI IAEDIEGEALTTL 

850 870 890 

GTGGTGAATAAATTAAGAGGCGTGTTGAATATCGCAGCGGTTAAAGCTCCAGGCTTTGGG 
VVNKLRGVLNIAAVKAP6FG 

910 930 950 

GACAGAAGAAAAGAAATGCTCAAAGACATCGCTATTTTAACCGGCGGTCAAGTCATTAGC 
DRR K EMLKD I A I LTGGQVI S 

970 990 1010 

GAAGAATTGGGCTTGAGTCTAGAAAACGCTGAAGTGGAGTTTTTAGGCAAAGCTGGAAGG 
EELGLSL ENAEVEFLGKAGR 

1030 1050 1070 

ATTGTGATTGACAAAGACAACACCACGATCGTAGATGGCAAAGGCCATAGCGATGATGTT 
I V I DKDNTT IVDGKGHSDDV 

1090 1110 1130 

AAAGACAGAGTCGCGCAGATCAAAACCCAAATTGCAAGTACGACAAGCGATTATGACAAA 
KDRVAQIKTOIASTTSDYDK 

1150 1170 1190 

GAAAAATTGCAAGAAAGATTGGCTAAACTCTCTGGCGGTGTGGCTGTGATTAAAGTGGGC 
EKLQERLAKLSGGVAVIKV G 

1210 1230 1250 

GCTGCGAGTGAAGTGGAAATGAAAGAGAAAAAAGACCGGGTGGATGACGCGTTGAGCGCG 
AASEVEMKEKKDRVDDALSA 

1270 1290 1310 

ACTAAAGCGGCGGTTGAAGAAGGCATTGTGATTGGTGGCGGTGCGGCTCTCATTCGCGCG 
TKAAVEEG I VIGGGAALIRA 
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1330 1350 1370 

6CTCAAAAA6T6CATTTGAATTTGCACGATGAT6AAAAAGTG66CTATGAAATCATCATG 
AQKVHLNLHDDEKVGYEI IM 

1390 1410 1430 

CGCGCCATTAAAGCCCCATTAGCTCAAATCGCTATCAACGCTGGTTATGATGGCGGTGTG 
RA I KAPLAQIA I NAGYDGGV 

1450 1470 1490 

GTCGTGAATGAAGTAGAAAAACACGAAGGGCATTTTGGTTTTAACGCTAGCAATGGCAAG 
VVNEVEKHEGHFGFNASNGK 

1510 1530 1550 

TATGTGGATATGTTTAAAGAAGGCATTATTGACCCGTTAAAAGTAGAAAGGATCGCTCTA 
YVDMFK EGI IDPLKVERIAL 

1570 1590 1610 

CAAAATGCGGTTTCGGTTTCAAGCCTGCTTTTAACCACAGAAGCCACCGTGCATGAAATC 
QNAVSVSSLLLTTEATVHEI 

1630 1650 1670 

AAAGAAGAAAAAGCGACTCCGGCAATGCCTGATATGGGTGGCATGGGCGGTATGGGAGGC 
KEEKATPAMPDMGGMGGMGG 

1690 1710 1730 

ATGGGCGGCATGATGTAAGCCCGCTTGCTTTTTAGTATAATCTGCTTTTAAAATCCCTTC 
M G G M M • 

1750 1770 1790 

TCTAAATCCCCCCCTTTCTAAAATCTCTTTTTTGGGGGGGTGCTTTGATAAAACCGCTCG 

1810 1830 
CTTGTAAAAACATGCAACAAAAAATCTCTGTTAAGCTT 
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